Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 39717 |
| Missing cells | 30047 |
| Missing cells (%) | 1.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 12.4 MiB |
| Average record size in memory | 328.0 B |
Variable types
| Numeric | 22 |
|---|---|
| Categorical | 19 |
int_rate has a high cardinality: 371 distinct values | High cardinality |
emp_title has a high cardinality: 28820 distinct values | High cardinality |
issue_d has a high cardinality: 55 distinct values | High cardinality |
title has a high cardinality: 19615 distinct values | High cardinality |
earliest_cr_line has a high cardinality: 526 distinct values | High cardinality |
revol_util has a high cardinality: 1089 distinct values | High cardinality |
last_pymnt_d has a high cardinality: 101 distinct values | High cardinality |
last_credit_pull_d has a high cardinality: 106 distinct values | High cardinality |
loan_amnt is highly correlated with funded_amnt and 2 other fields | High correlation |
funded_amnt is highly correlated with loan_amnt and 3 other fields | High correlation |
funded_amnt_inv is highly correlated with loan_amnt and 3 other fields | High correlation |
installment is highly correlated with loan_amnt and 2 other fields | High correlation |
out_prncp is highly correlated with out_prncp_inv | High correlation |
out_prncp_inv is highly correlated with out_prncp | High correlation |
total_pymnt is highly correlated with funded_amnt and 2 other fields | High correlation |
total_pymnt_inv is highly correlated with funded_amnt_inv and 2 other fields | High correlation |
total_rec_prncp is highly correlated with total_pymnt and 1 other fields | High correlation |
sub_grade is highly correlated with grade | High correlation |
grade is highly correlated with sub_grade | High correlation |
emp_title has 2459 (6.2%) missing values | Missing |
emp_length has 1075 (2.7%) missing values | Missing |
mths_since_last_delinq has 25682 (64.7%) missing values | Missing |
pub_rec_bankruptcies has 697 (1.8%) missing values | Missing |
annual_inc is highly skewed (γ1 = 30.9491846) | Skewed |
collection_recovery_fee is highly skewed (γ1 = 25.02941842) | Skewed |
delinq_2yrs has 35405 (89.1%) zeros | Zeros |
inq_last_6mths has 19300 (48.6%) zeros | Zeros |
mths_since_last_delinq has 443 (1.1%) zeros | Zeros |
revol_bal has 994 (2.5%) zeros | Zeros |
out_prncp has 38577 (97.1%) zeros | Zeros |
out_prncp_inv has 38577 (97.1%) zeros | Zeros |
total_rec_late_fee has 37671 (94.8%) zeros | Zeros |
recoveries has 35499 (89.4%) zeros | Zeros |
collection_recovery_fee has 35935 (90.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-14 06:00:34.455497 |
|---|---|
| Analysis finished | 2021-04-14 06:02:53.133444 |
| Duration | 2 minutes and 18.68 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 885 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11219.44381 |
|---|---|
| Minimum | 500 |
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2400 |
| Q1 | 5500 |
| median | 10000 |
| Q3 | 15000 |
| 95-th percentile | 25000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 9500 |
Descriptive statistics
| Standard deviation | 7456.670694 |
|---|---|
| Coefficient of variation (CV) | 0.6646203517 |
| Kurtosis | 0.7686685518 |
| Mean | 11219.44381 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 1.05931729 |
| Sum | 445602650 |
| Variance | 55601937.84 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2833 | 7.1% |
| 12000 | 2334 | 5.9% |
| 5000 | 2051 | 5.2% |
| 6000 | 1908 | 4.8% |
| 15000 | 1895 | 4.8% |
| 20000 | 1626 | 4.1% |
| 8000 | 1586 | 4.0% |
| 25000 | 1390 | 3.5% |
| 4000 | 1130 | 2.8% |
| 3000 | 1030 | 2.6% |
| Other values (875) | 21934 |
| Value | Count | Frequency (%) |
| 500 | 5 | |
| 700 | 1 | < 0.1% |
| 725 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 679 | |
| 34800 | 2 | < 0.1% |
| 34675 | 1 | < 0.1% |
| 34525 | 1 | < 0.1% |
| 34475 | 5 | < 0.1% |
| Distinct | 1041 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10947.7132 |
|---|---|
| Minimum | 500 |
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 500 |
|---|---|
| 5-th percentile | 2400 |
| Q1 | 5400 |
| median | 9600 |
| Q3 | 15000 |
| 95-th percentile | 25000 |
| Maximum | 35000 |
| Range | 34500 |
| Interquartile range (IQR) | 9600 |
Descriptive statistics
| Standard deviation | 7187.23867 |
|---|---|
| Coefficient of variation (CV) | 0.6565059334 |
| Kurtosis | 0.9375519943 |
| Mean | 10947.7132 |
| Median Absolute Deviation (MAD) | 4600 |
| Skewness | 1.081710238 |
| Sum | 434810325 |
| Variance | 51656399.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2741 | 6.9% |
| 12000 | 2244 | 5.6% |
| 5000 | 2040 | 5.1% |
| 6000 | 1898 | 4.8% |
| 15000 | 1784 | 4.5% |
| 8000 | 1573 | 4.0% |
| 20000 | 1456 | 3.7% |
| 25000 | 1133 | 2.9% |
| 4000 | 1127 | 2.8% |
| 3000 | 1022 | 2.6% |
| Other values (1031) | 22699 |
| Value | Count | Frequency (%) |
| 500 | 5 | |
| 700 | 1 | < 0.1% |
| 725 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 554 | |
| 34800 | 1 | < 0.1% |
| 34675 | 2 | < 0.1% |
| 34525 | 1 | < 0.1% |
| 34475 | 4 | < 0.1% |
| Distinct | 8205 |
|---|---|
| Distinct (%) | 20.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10397.44887 |
|---|---|
| Minimum | 0 |
| Maximum | 35000 |
| Zeros | 129 |
| Zeros (%) | 0.3% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1873.658 |
| Q1 | 5000 |
| median | 8975 |
| Q3 | 14400 |
| 95-th percentile | 24736.57226 |
| Maximum | 35000 |
| Range | 35000 |
| Interquartile range (IQR) | 9400 |
Descriptive statistics
| Standard deviation | 7128.450439 |
|---|---|
| Coefficient of variation (CV) | 0.6855961044 |
| Kurtosis | 1.062544362 |
| Mean | 10397.44887 |
| Median Absolute Deviation (MAD) | 4200 |
| Skewness | 1.106212938 |
| Sum | 412955476.7 |
| Variance | 50814805.66 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 1309 | 3.3% |
| 10000 | 1275 | 3.2% |
| 6000 | 1200 | 3.0% |
| 12000 | 1069 | 2.7% |
| 8000 | 900 | 2.3% |
| 4000 | 812 | 2.0% |
| 3000 | 803 | 2.0% |
| 15000 | 657 | 1.7% |
| 7000 | 600 | 1.5% |
| 2000 | 452 | 1.1% |
| Other values (8195) | 30640 |
| Value | Count | Frequency (%) |
| 0 | 129 | |
| 0.000121098 | 1 | < 0.1% |
| 0.000531133 | 1 | < 0.1% |
| 0.000654607 | 1 | < 0.1% |
| 0.001867696 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 135 | |
| 34997.35245 | 1 | < 0.1% |
| 34993.65539 | 1 | < 0.1% |
| 34993.32571 | 1 | < 0.1% |
| 34993.26306 | 1 | < 0.1% |
term
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| 36 months | |
|---|---|
| 60 months |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 397170 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 36 months |
|---|---|
| 2nd row | 60 months |
| 3rd row | 36 months |
| 4th row | 36 months |
| 5th row | 60 months |
| Value | Count | Frequency (%) |
| 36 months | 29096 | |
| 60 months | 10621 | 26.7% |
| Value | Count | Frequency (%) |
| months | 39717 | |
| 36 | 29096 | |
| 60 | 10621 | 13.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 79434 | ||
| 6 | 39717 | |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 | |
| 3 | 29096 | 7.3% |
| 0 | 10621 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 238302 | |
| Space Separator | 79434 | 20.0% |
| Decimal Number | 79434 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 |
| Value | Count | Frequency (%) |
| 6 | 39717 | |
| 3 | 29096 | |
| 0 | 10621 | 13.4% |
| Value | Count | Frequency (%) |
| 79434 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 238302 | |
| Common | 158868 |
Most frequent character per script
| Value | Count | Frequency (%) |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 |
| Value | Count | Frequency (%) |
| 79434 | ||
| 6 | 39717 | |
| 3 | 29096 | 18.3% |
| 0 | 10621 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 397170 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 79434 | ||
| 6 | 39717 | |
| m | 39717 | |
| o | 39717 | |
| n | 39717 | |
| t | 39717 | |
| h | 39717 | |
| s | 39717 | |
| 3 | 29096 | 7.3% |
| 0 | 10621 | 2.7% |
| Distinct | 371 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| 10.99% | 956 |
|---|---|
| 13.49% | 826 |
| 11.49% | 825 |
| 7.51% | 787 |
| 7.88% | 725 |
| Other values (366) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.694287081 |
| Min length | 5 |
Characters and Unicode
| Total characters | 226160 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 10.65% |
|---|---|
| 2nd row | 15.27% |
| 3rd row | 15.96% |
| 4th row | 13.49% |
| 5th row | 12.69% |
| Value | Count | Frequency (%) |
| 10.99% | 956 | 2.4% |
| 13.49% | 826 | 2.1% |
| 11.49% | 825 | 2.1% |
| 7.51% | 787 | 2.0% |
| 7.88% | 725 | 1.8% |
| 7.49% | 656 | 1.7% |
| 11.71% | 607 | 1.5% |
| 9.99% | 603 | 1.5% |
| 7.90% | 582 | 1.5% |
| 5.42% | 573 | 1.4% |
| Other values (361) | 32577 |
| Value | Count | Frequency (%) |
| 10.99 | 956 | 2.4% |
| 13.49 | 826 | 2.1% |
| 11.49 | 825 | 2.1% |
| 7.51 | 787 | 2.0% |
| 7.88 | 725 | 1.8% |
| 7.49 | 656 | 1.7% |
| 11.71 | 607 | 1.5% |
| 9.99 | 603 | 1.5% |
| 7.90 | 582 | 1.5% |
| 5.42 | 573 | 1.4% |
| Other values (361) | 32577 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 | |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 5.6% |
| 7 | 12132 | 5.4% |
| 6 | 12033 | 5.3% |
| 4 | 11091 | 4.9% |
| 5 | 9947 | 4.4% |
| 3 | 9929 | 4.4% |
| Other values (2) | 18772 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 146726 | |
| Other Punctuation | 79434 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 8.7% |
| 7 | 12132 | 8.3% |
| 6 | 12033 | 8.2% |
| 4 | 11091 | 7.6% |
| 5 | 9947 | 6.8% |
| 3 | 9929 | 6.8% |
| 8 | 9527 | 6.5% |
| 0 | 9245 | 6.3% |
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 226160 |
Most frequent character per script
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 | |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 5.6% |
| 7 | 12132 | 5.4% |
| 6 | 12033 | 5.3% |
| 4 | 11091 | 4.9% |
| 5 | 9947 | 4.4% |
| 3 | 9929 | 4.4% |
| Other values (2) | 18772 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 226160 |
Most frequent character per block
| Value | Count | Frequency (%) |
| . | 39717 | |
| % | 39717 | |
| 1 | 38195 | |
| 9 | 21893 | |
| 2 | 12734 | 5.6% |
| 7 | 12132 | 5.4% |
| 6 | 12033 | 5.3% |
| 4 | 11091 | 4.9% |
| 5 | 9947 | 4.4% |
| 3 | 9929 | 4.4% |
| Other values (2) | 18772 |
| Distinct | 15383 |
|---|---|
| Distinct (%) | 38.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 324.5619221 |
|---|---|
| Minimum | 15.69 |
| Maximum | 1305.19 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 15.69 |
|---|---|
| 5-th percentile | 71.246 |
| Q1 | 167.02 |
| median | 280.22 |
| Q3 | 430.78 |
| 95-th percentile | 762.996 |
| Maximum | 1305.19 |
| Range | 1289.5 |
| Interquartile range (IQR) | 263.76 |
Descriptive statistics
| Standard deviation | 208.8748735 |
|---|---|
| Coefficient of variation (CV) | 0.6435593929 |
| Kurtosis | 1.246801303 |
| Mean | 324.5619221 |
| Median Absolute Deviation (MAD) | 123.2 |
| Skewness | 1.128419095 |
| Sum | 12890625.86 |
| Variance | 43628.71279 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 311.11 | 68 | 0.2% |
| 180.96 | 59 | 0.1% |
| 311.02 | 54 | 0.1% |
| 150.8 | 48 | 0.1% |
| 368.45 | 46 | 0.1% |
| 372.12 | 45 | 0.1% |
| 330.76 | 43 | 0.1% |
| 339.31 | 42 | 0.1% |
| 301.6 | 41 | 0.1% |
| 317.72 | 41 | 0.1% |
| Other values (15373) | 39230 |
| Value | Count | Frequency (%) |
| 15.69 | 1 | |
| 16.08 | 1 | |
| 16.25 | 1 | |
| 16.31 | 1 | |
| 16.47 | 1 |
| Value | Count | Frequency (%) |
| 1305.19 | 1 | |
| 1302.69 | 1 | |
| 1295.21 | 1 | |
| 1288.1 | 2 | |
| 1283.5 | 1 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| B | |
|---|---|
| A | |
| C | |
| D | |
| E | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 39717 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | C |
| 5th row | B |
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
| Value | Count | Frequency (%) |
| b | 12020 | |
| a | 10085 | |
| c | 8098 | |
| d | 5307 | |
| e | 2842 | 7.2% |
| f | 1049 | 2.6% |
| g | 316 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 39717 |
Most frequent character per category
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39717 |
Most frequent character per script
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39717 |
Most frequent character per block
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
| Distinct | 35 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| B3 | |
|---|---|
| A4 | |
| A5 | |
| B5 | |
| B4 | 2512 |
| Other values (30) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 79434 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B2 |
|---|---|
| 2nd row | C4 |
| 3rd row | C5 |
| 4th row | C1 |
| 5th row | B5 |
| Value | Count | Frequency (%) |
| B3 | 2917 | 7.3% |
| A4 | 2886 | 7.3% |
| A5 | 2742 | 6.9% |
| B5 | 2704 | 6.8% |
| B4 | 2512 | 6.3% |
| C1 | 2136 | 5.4% |
| B2 | 2057 | 5.2% |
| C2 | 2011 | 5.1% |
| B1 | 1830 | 4.6% |
| A3 | 1810 | 4.6% |
| Other values (25) | 16112 |
| Value | Count | Frequency (%) |
| b3 | 2917 | 7.3% |
| a4 | 2886 | 7.3% |
| a5 | 2742 | 6.9% |
| b5 | 2704 | 6.8% |
| b4 | 2512 | 6.3% |
| c1 | 2136 | 5.4% |
| b2 | 2057 | 5.2% |
| c2 | 2011 | 5.1% |
| b1 | 1830 | 4.6% |
| a3 | 1810 | 4.6% |
| Other values (25) | 16112 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| 4 | 8293 | |
| 3 | 8215 | |
| C | 8098 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 | |
| D | 5307 | |
| E | 2842 | 3.6% |
| Other values (2) | 1365 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 39717 | |
| Decimal Number | 39717 |
Most frequent character per category
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
| Value | Count | Frequency (%) |
| 4 | 8293 | |
| 3 | 8215 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39717 | |
| Common | 39717 |
Most frequent character per script
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| C | 8098 | |
| D | 5307 | |
| E | 2842 | 7.2% |
| F | 1049 | 2.6% |
| G | 316 | 0.8% |
| Value | Count | Frequency (%) |
| 4 | 8293 | |
| 3 | 8215 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79434 |
Most frequent character per block
| Value | Count | Frequency (%) |
| B | 12020 | |
| A | 10085 | |
| 4 | 8293 | |
| 3 | 8215 | |
| C | 8098 | |
| 5 | 8070 | |
| 2 | 7907 | |
| 1 | 7232 | |
| D | 5307 | |
| E | 2842 | 3.6% |
| Other values (2) | 1365 | 1.7% |
| Distinct | 28820 |
|---|---|
| Distinct (%) | 77.4% |
| Missing | 2459 |
| Missing (%) | 6.2% |
| Memory size | 310.4 KiB |
| US Army | 134 |
|---|---|
| Bank of America | 109 |
| IBM | 66 |
| AT&T | 59 |
| Kaiser Permanente | 56 |
| Other values (28815) |
Length
| Max length | 78 |
|---|---|
| Median length | 18 |
| Mean length | 18.37978421 |
| Min length | 2 |
Characters and Unicode
| Total characters | 684794 |
|---|---|
| Distinct characters | 96 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 25641 ? |
|---|---|
| Unique (%) | 68.8% |
Sample
| 1st row | Ryder |
|---|---|
| 2nd row | AIR RESOURCES BOARD |
| 3rd row | University Medical Group |
| 4th row | Veolia Transportaton |
| 5th row | Southern Star Photography |
| Value | Count | Frequency (%) |
| US Army | 134 | 0.3% |
| Bank of America | 109 | 0.3% |
| IBM | 66 | 0.2% |
| AT&T | 59 | 0.1% |
| Kaiser Permanente | 56 | 0.1% |
| Wells Fargo | 54 | 0.1% |
| USAF | 54 | 0.1% |
| UPS | 53 | 0.1% |
| US Air Force | 52 | 0.1% |
| Walmart | 45 | 0.1% |
| Other values (28810) | 36576 | |
| (Missing) | 2459 | 6.2% |
| Value | Count | Frequency (%) |
| inc | 3197 | 3.2% |
| of | 3008 | 3.0% |
| 1208 | 1.2% | |
| and | 963 | 1.0% |
| center | 818 | 0.8% |
| bank | 805 | 0.8% |
| county | 803 | 0.8% |
| services | 795 | 0.8% |
| school | 750 | 0.7% |
| the | 747 | 0.7% |
| Other values (18882) | 87491 |
Most occurring characters
| Value | Count | Frequency (%) |
| 64766 | 9.5% | |
| e | 55954 | 8.2% |
| a | 43836 | 6.4% |
| n | 42641 | 6.2% |
| o | 42586 | 6.2% |
| i | 40491 | 5.9% |
| r | 40067 | 5.9% |
| t | 38580 | 5.6% |
| s | 30254 | 4.4% |
| l | 25923 | 3.8% |
| Other values (86) | 259696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 489338 | |
| Uppercase Letter | 119545 | 17.5% |
| Space Separator | 64766 | 9.5% |
| Other Punctuation | 8798 | 1.3% |
| Dash Punctuation | 1031 | 0.2% |
| Decimal Number | 968 | 0.1% |
| Open Punctuation | 159 | < 0.1% |
| Close Punctuation | 156 | < 0.1% |
| Math Symbol | 21 | < 0.1% |
| Other Symbol | 2 | < 0.1% |
| Other values (5) | 10 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| C | 14579 | 12.2% |
| S | 13325 | 11.1% |
| A | 8885 | 7.4% |
| I | 7566 | 6.3% |
| M | 6518 | 5.5% |
| P | 6077 | 5.1% |
| T | 5691 | 4.8% |
| L | 5561 | 4.7% |
| E | 5241 | 4.4% |
| D | 5056 | 4.2% |
| Other values (18) | 41046 |
| Value | Count | Frequency (%) |
| e | 55954 | |
| a | 43836 | |
| n | 42641 | |
| o | 42586 | |
| i | 40491 | 8.3% |
| r | 40067 | 8.2% |
| t | 38580 | 7.9% |
| s | 30254 | 6.2% |
| l | 25923 | 5.3% |
| c | 23099 | 4.7% |
| Other values (17) | 105907 |
| Value | Count | Frequency (%) |
| . | 4253 | |
| , | 2194 | |
| & | 1301 | 14.8% |
| ' | 652 | 7.4% |
| / | 311 | 3.5% |
| # | 36 | 0.4% |
| @ | 10 | 0.1% |
| : | 9 | 0.1% |
| " | 8 | 0.1% |
| ! | 8 | 0.1% |
| Other values (5) | 16 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 192 | |
| 2 | 161 | |
| 3 | 155 | |
| 0 | 98 | |
| 4 | 91 | |
| 5 | 72 | 7.4% |
| 9 | 62 | 6.4% |
| 6 | 58 | 6.0% |
| 7 | 46 | 4.8% |
| 8 | 33 | 3.4% |
| Value | Count | Frequency (%) |
| + | 18 | |
| | | 2 | 9.5% |
| < | 1 | 4.8% |
| Value | Count | Frequency (%) |
| ( | 158 | |
| [ | 1 | 0.6% |
| Value | Count | Frequency (%) |
| | 1 | |
| | 1 |
| Value | Count | Frequency (%) |
| ¢ | 1 | |
| $ | 1 |
| Value | Count | Frequency (%) |
| 64766 |
| Value | Count | Frequency (%) |
| - | 1031 |
| Value | Count | Frequency (%) |
| ) | 156 |
| Value | Count | Frequency (%) |
| © | 2 |
| Value | Count | Frequency (%) |
| ` | 2 |
| Value | Count | Frequency (%) |
| _ | 2 |
| Value | Count | Frequency (%) |
| ² | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 608883 | |
| Common | 75911 | 11.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 55954 | 9.2% |
| a | 43836 | 7.2% |
| n | 42641 | 7.0% |
| o | 42586 | 7.0% |
| i | 40491 | 6.7% |
| r | 40067 | 6.6% |
| t | 38580 | 6.3% |
| s | 30254 | 5.0% |
| l | 25923 | 4.3% |
| c | 23099 | 3.8% |
| Other values (45) | 225452 |
| Value | Count | Frequency (%) |
| 64766 | ||
| . | 4253 | 5.6% |
| , | 2194 | 2.9% |
| & | 1301 | 1.7% |
| - | 1031 | 1.4% |
| ' | 652 | 0.9% |
| / | 311 | 0.4% |
| 1 | 192 | 0.3% |
| 2 | 161 | 0.2% |
| ( | 158 | 0.2% |
| Other values (31) | 892 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 684780 | |
| None | 14 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| 64766 | 9.5% | |
| e | 55954 | 8.2% |
| a | 43836 | 6.4% |
| n | 42641 | 6.2% |
| o | 42586 | 6.2% |
| i | 40491 | 5.9% |
| r | 40067 | 5.9% |
| t | 38580 | 5.6% |
| s | 30254 | 4.4% |
| l | 25923 | 3.8% |
| Other values (77) | 259682 |
| Value | Count | Frequency (%) |
| Ã | 3 | |
| © | 2 | |
| Â | 2 | |
| ² | 2 | |
| â | 1 | 7.1% |
| | 1 | 7.1% |
| ¢ | 1 | 7.1% |
| | 1 | 7.1% |
| ¡ | 1 | 7.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1075 |
| Missing (%) | 2.7% |
| Memory size | 310.4 KiB |
| 10+ years | |
|---|---|
| < 1 year | |
| 2 years | |
| 3 years | |
| 4 years | |
| Other values (6) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.494306713 |
| Min length | 6 |
Characters and Unicode
| Total characters | 289595 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10+ years |
|---|---|
| 2nd row | < 1 year |
| 3rd row | 10+ years |
| 4th row | 10+ years |
| 5th row | 1 year |
| Value | Count | Frequency (%) |
| 10+ years | 8879 | |
| < 1 year | 4583 | |
| 2 years | 4388 | |
| 3 years | 4095 | |
| 4 years | 3436 | 8.7% |
| 5 years | 3282 | 8.3% |
| 1 year | 3240 | 8.2% |
| 6 years | 2229 | 5.6% |
| 7 years | 1773 | 4.5% |
| 8 years | 1479 | 3.7% |
| Value | Count | Frequency (%) |
| years | 30819 | |
| 10 | 8879 | 10.8% |
| 1 | 7823 | 9.6% |
| year | 7823 | 9.6% |
| 4583 | 5.6% | |
| 2 | 4388 | 5.4% |
| 3 | 4095 | 5.0% |
| 4 | 3436 | 4.2% |
| 5 | 3282 | 4.0% |
| 6 | 2229 | 2.7% |
| Other values (3) | 4510 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 43225 | ||
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 | |
| 1 | 16702 | 5.8% |
| 0 | 8879 | 3.1% |
| + | 8879 | 3.1% |
| < | 4583 | 1.6% |
| Other values (8) | 21940 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 185387 | |
| Decimal Number | 47521 | 16.4% |
| Space Separator | 43225 | 14.9% |
| Math Symbol | 13462 | 4.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 16702 | |
| 0 | 8879 | |
| 2 | 4388 | 9.2% |
| 3 | 4095 | 8.6% |
| 4 | 3436 | 7.2% |
| 5 | 3282 | 6.9% |
| 6 | 2229 | 4.7% |
| 7 | 1773 | 3.7% |
| 8 | 1479 | 3.1% |
| 9 | 1258 | 2.6% |
| Value | Count | Frequency (%) |
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 |
| Value | Count | Frequency (%) |
| + | 8879 | |
| < | 4583 |
| Value | Count | Frequency (%) |
| 43225 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 185387 | |
| Common | 104208 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 43225 | ||
| 1 | 16702 | 16.0% |
| 0 | 8879 | 8.5% |
| + | 8879 | 8.5% |
| < | 4583 | 4.4% |
| 2 | 4388 | 4.2% |
| 3 | 4095 | 3.9% |
| 4 | 3436 | 3.3% |
| 5 | 3282 | 3.1% |
| 6 | 2229 | 2.1% |
| Other values (3) | 4510 | 4.3% |
| Value | Count | Frequency (%) |
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 289595 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 43225 | ||
| y | 38642 | |
| e | 38642 | |
| a | 38642 | |
| r | 38642 | |
| s | 30819 | |
| 1 | 16702 | 5.8% |
| 0 | 8879 | 3.1% |
| + | 8879 | 3.1% |
| < | 4583 | 1.6% |
| Other values (8) | 21940 |
home_ownership
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN | |
| OTHER | 98 |
| NONE | 3 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.703955485 |
| Min length | 3 |
Characters and Unicode
| Total characters | 226544 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | RENT |
| 3rd row | RENT |
| 4th row | RENT |
| 5th row | RENT |
| Value | Count | Frequency (%) |
| RENT | 18899 | |
| MORTGAGE | 17659 | |
| OWN | 3058 | 7.7% |
| OTHER | 98 | 0.2% |
| NONE | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| rent | 18899 | |
| mortgage | 17659 | |
| own | 3058 | 7.7% |
| other | 98 | 0.2% |
| none | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 226544 |
Most frequent character per category
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 226544 |
Most frequent character per script
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 226544 |
Most frequent character per block
| Value | Count | Frequency (%) |
| E | 36659 | |
| R | 36656 | |
| T | 36656 | |
| G | 35318 | |
| N | 21963 | |
| O | 20818 | |
| M | 17659 | |
| A | 17659 | |
| W | 3058 | 1.3% |
| H | 98 | < 0.1% |
| Distinct | 5318 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68968.92638 |
|---|---|
| Minimum | 4000 |
| Maximum | 6000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 4000 |
|---|---|
| 5-th percentile | 24000 |
| Q1 | 40404 |
| median | 59000 |
| Q3 | 82300 |
| 95-th percentile | 142000 |
| Maximum | 6000000 |
| Range | 5996000 |
| Interquartile range (IQR) | 41896 |
Descriptive statistics
| Standard deviation | 63793.76579 |
|---|---|
| Coefficient of variation (CV) | 0.9249638807 |
| Kurtosis | 2302.737777 |
| Mean | 68968.92638 |
| Median Absolute Deviation (MAD) | 20000 |
| Skewness | 30.9491846 |
| Sum | 2739238849 |
| Variance | 4069644554 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 1505 | 3.8% |
| 50000 | 1057 | 2.7% |
| 40000 | 876 | 2.2% |
| 45000 | 830 | 2.1% |
| 30000 | 825 | 2.1% |
| 75000 | 811 | 2.0% |
| 65000 | 803 | 2.0% |
| 70000 | 733 | 1.8% |
| 48000 | 723 | 1.8% |
| 80000 | 662 | 1.7% |
| Other values (5308) | 30892 |
| Value | Count | Frequency (%) |
| 4000 | 1 | < 0.1% |
| 4080 | 1 | < 0.1% |
| 4200 | 2 | |
| 4800 | 4 | |
| 4888 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6000000 | 1 | |
| 3900000 | 1 | |
| 2039784 | 1 | |
| 1900000 | 1 | |
| 1782000 | 1 |
verification_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Not Verified | |
|---|---|
| Verified | |
| Source Verified |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 11.46433517 |
| Min length | 8 |
Characters and Unicode
| Total characters | 455329 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Verified |
|---|---|
| 2nd row | Source Verified |
| 3rd row | Not Verified |
| 4th row | Source Verified |
| 5th row | Source Verified |
| Value | Count | Frequency (%) |
| Not Verified | 16921 | |
| Verified | 12809 | |
| Source Verified | 9987 |
| Value | Count | Frequency (%) |
| verified | 39717 | |
| not | 16921 | |
| source | 9987 | 15.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| V | 39717 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 5.9% |
| 26908 | 5.9% | |
| N | 16921 | 3.7% |
| t | 16921 | 3.7% |
| Other values (3) | 29961 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 361796 | |
| Uppercase Letter | 66625 | 14.6% |
| Space Separator | 26908 | 5.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 7.4% |
| t | 16921 | 4.7% |
| u | 9987 | 2.8% |
| c | 9987 | 2.8% |
| Value | Count | Frequency (%) |
| V | 39717 | |
| N | 16921 | |
| S | 9987 | 15.0% |
| Value | Count | Frequency (%) |
| 26908 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 428421 | |
| Common | 26908 | 5.9% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| V | 39717 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 6.3% |
| N | 16921 | 3.9% |
| t | 16921 | 3.9% |
| S | 9987 | 2.3% |
| Other values (2) | 19974 | 4.7% |
| Value | Count | Frequency (%) |
| 26908 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 455329 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 89421 | |
| i | 79434 | |
| r | 49704 | |
| V | 39717 | |
| f | 39717 | |
| d | 39717 | |
| o | 26908 | 5.9% |
| 26908 | 5.9% | |
| N | 16921 | 3.7% |
| t | 16921 | 3.7% |
| Other values (3) | 29961 | 6.6% |
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Dec-11 | 2260 |
|---|---|
| Nov-11 | 2223 |
| Oct-11 | 2114 |
| Sep-11 | 2063 |
| Aug-11 | 1928 |
| Other values (50) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 238302 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Dec-11 |
|---|---|
| 2nd row | Dec-11 |
| 3rd row | Dec-11 |
| 4th row | Dec-11 |
| 5th row | Dec-11 |
| Value | Count | Frequency (%) |
| Dec-11 | 2260 | 5.7% |
| Nov-11 | 2223 | 5.6% |
| Oct-11 | 2114 | 5.3% |
| Sep-11 | 2063 | 5.2% |
| Aug-11 | 1928 | 4.9% |
| Jul-11 | 1870 | 4.7% |
| Jun-11 | 1827 | 4.6% |
| May-11 | 1689 | 4.3% |
| Apr-11 | 1562 | 3.9% |
| Mar-11 | 1443 | 3.6% |
| Other values (45) | 20738 |
| Value | Count | Frequency (%) |
| dec-11 | 2260 | 5.7% |
| nov-11 | 2223 | 5.6% |
| oct-11 | 2114 | 5.3% |
| sep-11 | 2063 | 5.2% |
| aug-11 | 1928 | 4.9% |
| jul-11 | 1870 | 4.7% |
| jun-11 | 1827 | 4.6% |
| may-11 | 1689 | 4.3% |
| apr-11 | 1562 | 3.9% |
| mar-11 | 1443 | 3.6% |
| Other values (45) | 20738 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 54844 | |
| - | 39717 | |
| 0 | 18061 | 7.6% |
| e | 10439 | 4.4% |
| u | 10273 | 4.3% |
| J | 9134 | 3.8% |
| c | 8367 | 3.5% |
| a | 8070 | 3.4% |
| p | 6482 | 2.7% |
| A | 6352 | 2.7% |
| Other values (18) | 66563 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79434 | |
| Decimal Number | 79434 | |
| Uppercase Letter | 39717 | |
| Dash Punctuation | 39717 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 10439 | |
| u | 10273 | |
| c | 8367 | |
| a | 8070 | |
| p | 6482 | |
| n | 5658 | |
| r | 5526 | |
| o | 4167 | 5.2% |
| v | 4167 | 5.2% |
| t | 3934 | 5.0% |
| Other values (4) | 12351 |
| Value | Count | Frequency (%) |
| J | 9134 | |
| A | 6352 | |
| M | 5691 | |
| D | 4433 | |
| N | 4167 | |
| O | 3934 | |
| S | 3648 | 9.2% |
| F | 2358 | 5.9% |
| Value | Count | Frequency (%) |
| 1 | 54844 | |
| 0 | 18061 | 22.7% |
| 9 | 4716 | 5.9% |
| 8 | 1562 | 2.0% |
| 7 | 251 | 0.3% |
| Value | Count | Frequency (%) |
| - | 39717 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119151 | |
| Common | 119151 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 10439 | 8.8% |
| u | 10273 | 8.6% |
| J | 9134 | 7.7% |
| c | 8367 | 7.0% |
| a | 8070 | 6.8% |
| p | 6482 | 5.4% |
| A | 6352 | 5.3% |
| M | 5691 | 4.8% |
| n | 5658 | 4.7% |
| r | 5526 | 4.6% |
| Other values (12) | 43159 |
| Value | Count | Frequency (%) |
| 1 | 54844 | |
| - | 39717 | |
| 0 | 18061 | 15.2% |
| 9 | 4716 | 4.0% |
| 8 | 1562 | 1.3% |
| 7 | 251 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238302 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 54844 | |
| - | 39717 | |
| 0 | 18061 | 7.6% |
| e | 10439 | 4.4% |
| u | 10273 | 4.3% |
| J | 9134 | 3.8% |
| c | 8367 | 3.5% |
| a | 8070 | 3.4% |
| p | 6482 | 2.7% |
| A | 6352 | 2.7% |
| Other values (18) | 66563 |
loan_status
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Fully Paid | |
|---|---|
| Charged Off | |
| Current | 1140 |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10.05556814 |
| Min length | 7 |
Characters and Unicode
| Total characters | 399377 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fully Paid |
|---|---|
| 2nd row | Charged Off |
| 3rd row | Fully Paid |
| 4th row | Fully Paid |
| 5th row | Current |
| Value | Count | Frequency (%) |
| Fully Paid | 32950 | |
| Charged Off | 5627 | 14.2% |
| Current | 1140 | 2.9% |
| Value | Count | Frequency (%) |
| fully | 32950 | |
| paid | 32950 | |
| off | 5627 | 7.2% |
| charged | 5627 | 7.2% |
| current | 1140 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 65900 | |
| 38577 | ||
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| F | 32950 | |
| y | 32950 | |
| P | 32950 | |
| i | 32950 | |
| f | 11254 | 2.8% |
| Other values (8) | 40602 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 282506 | |
| Uppercase Letter | 78294 | 19.6% |
| Space Separator | 38577 | 9.7% |
Most frequent character per category
| Value | Count | Frequency (%) |
| l | 65900 | |
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| y | 32950 | |
| i | 32950 | |
| f | 11254 | 4.0% |
| r | 7907 | 2.8% |
| e | 6767 | 2.4% |
| h | 5627 | 2.0% |
| Other values (3) | 7907 | 2.8% |
| Value | Count | Frequency (%) |
| F | 32950 | |
| P | 32950 | |
| C | 6767 | 8.6% |
| O | 5627 | 7.2% |
| Value | Count | Frequency (%) |
| 38577 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 360800 | |
| Common | 38577 | 9.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| l | 65900 | |
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| F | 32950 | |
| y | 32950 | |
| P | 32950 | |
| i | 32950 | |
| f | 11254 | 3.1% |
| r | 7907 | 2.2% |
| Other values (7) | 32695 |
| Value | Count | Frequency (%) |
| 38577 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 399377 |
Most frequent character per block
| Value | Count | Frequency (%) |
| l | 65900 | |
| 38577 | ||
| a | 38577 | |
| d | 38577 | |
| u | 34090 | |
| F | 32950 | |
| y | 32950 | |
| P | 32950 | |
| i | 32950 | |
| f | 11254 | 2.8% |
| Other values (8) | 40602 |
purpose
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| other | |
| home_improvement | |
| major_purchase | |
| Other values (9) |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 13.7361835 |
| Min length | 3 |
Characters and Unicode
| Total characters | 545560 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | credit_card |
|---|---|
| 2nd row | car |
| 3rd row | small_business |
| 4th row | other |
| 5th row | other |
| Value | Count | Frequency (%) |
| debt_consolidation | 18641 | |
| credit_card | 5130 | 12.9% |
| other | 3993 | 10.1% |
| home_improvement | 2976 | 7.5% |
| major_purchase | 2187 | 5.5% |
| small_business | 1828 | 4.6% |
| car | 1549 | 3.9% |
| wedding | 947 | 2.4% |
| medical | 693 | 1.7% |
| moving | 583 | 1.5% |
| Other values (4) | 1190 | 3.0% |
| Value | Count | Frequency (%) |
| debt_consolidation | 18641 | |
| credit_card | 5130 | 12.9% |
| other | 3993 | 10.1% |
| home_improvement | 2976 | 7.5% |
| major_purchase | 2187 | 5.5% |
| small_business | 1828 | 4.6% |
| car | 1549 | 3.9% |
| wedding | 947 | 2.4% |
| medical | 693 | 1.7% |
| moving | 583 | 1.5% |
| Other values (4) | 1190 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | 8.0% |
| c | 34036 | 6.2% |
| a | 33730 | 6.2% |
| _ | 30865 | 5.7% |
| s | 28521 | 5.2% |
| Other values (12) | 109901 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 514695 | |
| Connector Punctuation | 30865 | 5.7% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | |
| c | 34036 | 6.6% |
| a | 33730 | 6.6% |
| s | 28521 | 5.5% |
| l | 23418 | 4.5% |
| Other values (11) | 86483 |
| Value | Count | Frequency (%) |
| _ | 30865 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 514695 | |
| Common | 30865 | 5.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | |
| c | 34036 | 6.6% |
| a | 33730 | 6.6% |
| s | 28521 | 5.5% |
| l | 23418 | 4.5% |
| Other values (11) | 86483 |
| Value | Count | Frequency (%) |
| _ | 30865 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 545560 |
Most frequent character per block
| Value | Count | Frequency (%) |
| o | 69725 | |
| d | 50454 | |
| i | 50145 | |
| t | 50087 | |
| n | 44528 | |
| e | 43568 | 8.0% |
| c | 34036 | 6.2% |
| a | 33730 | 6.2% |
| _ | 30865 | 5.7% |
| s | 28521 | 5.2% |
| Other values (12) | 109901 |
| Distinct | 19615 |
|---|---|
| Distinct (%) | 49.4% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 310.4 KiB |
| Debt Consolidation | 2184 |
|---|---|
| Debt Consolidation Loan | 1729 |
| Personal Loan | 659 |
| Consolidation | 517 |
| debt consolidation | 505 |
| Other values (19610) |
Length
| Max length | 80 |
|---|---|
| Median length | 16 |
| Mean length | 17.18732685 |
| Min length | 1 |
Characters and Unicode
| Total characters | 682440 |
|---|---|
| Distinct characters | 108 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 17624 ? |
|---|---|
| Unique (%) | 44.4% |
Sample
| 1st row | Computer |
|---|---|
| 2nd row | bike |
| 3rd row | real estate business |
| 4th row | personel |
| 5th row | Personal |
| Value | Count | Frequency (%) |
| Debt Consolidation | 2184 | 5.5% |
| Debt Consolidation Loan | 1729 | 4.4% |
| Personal Loan | 659 | 1.7% |
| Consolidation | 517 | 1.3% |
| debt consolidation | 505 | 1.3% |
| Credit Card Consolidation | 356 | 0.9% |
| Home Improvement | 356 | 0.9% |
| Debt consolidation | 334 | 0.8% |
| Small Business Loan | 328 | 0.8% |
| Credit Card Loan | 317 | 0.8% |
| Other values (19605) | 32421 |
| Value | Count | Frequency (%) |
| loan | 10895 | 10.4% |
| debt | 9245 | 8.8% |
| consolidation | 8622 | 8.2% |
| credit | 4604 | 4.4% |
| card | 3341 | 3.2% |
| personal | 2043 | 2.0% |
| home | 1875 | 1.8% |
| pay | 1344 | 1.3% |
| off | 1259 | 1.2% |
| my | 1133 | 1.1% |
| Other values (8935) | 60203 |
Most occurring characters
| Value | Count | Frequency (%) |
| 66029 | 9.7% | |
| o | 65729 | 9.6% |
| n | 55657 | 8.2% |
| e | 54557 | 8.0% |
| a | 50167 | 7.4% |
| i | 43822 | 6.4% |
| t | 42600 | 6.2% |
| d | 30679 | 4.5% |
| r | 29153 | 4.3% |
| s | 28544 | 4.2% |
| Other values (98) | 215503 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 521300 | |
| Uppercase Letter | 83242 | 12.2% |
| Space Separator | 66029 | 9.7% |
| Decimal Number | 5995 | 0.9% |
| Other Punctuation | 4442 | 0.7% |
| Dash Punctuation | 824 | 0.1% |
| Connector Punctuation | 213 | < 0.1% |
| Close Punctuation | 104 | < 0.1% |
| Currency Symbol | 94 | < 0.1% |
| Math Symbol | 92 | < 0.1% |
| Other values (5) | 105 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| C | 18509 | |
| L | 10335 | |
| D | 9244 | |
| P | 5641 | 6.8% |
| R | 3732 | 4.5% |
| M | 3256 | 3.9% |
| S | 3227 | 3.9% |
| B | 3116 | 3.7% |
| H | 2910 | 3.5% |
| I | 2885 | 3.5% |
| Other values (18) | 20387 |
| Value | Count | Frequency (%) |
| o | 65729 | |
| n | 55657 | |
| e | 54557 | |
| a | 50167 | |
| i | 43822 | |
| t | 42600 | |
| d | 30679 | 5.9% |
| r | 29153 | 5.6% |
| s | 28544 | 5.5% |
| l | 26300 | 5.0% |
| Other values (18) | 94092 |
| Value | Count | Frequency (%) |
| ! | 1123 | |
| ' | 982 | |
| . | 712 | |
| / | 538 | |
| , | 435 | 9.8% |
| & | 328 | 7.4% |
| % | 95 | 2.1% |
| : | 64 | 1.4% |
| " | 56 | 1.3% |
| # | 25 | 0.6% |
| Other values (5) | 84 | 1.9% |
| Value | Count | Frequency (%) |
| 1 | 1691 | |
| 0 | 1677 | |
| 2 | 1105 | |
| 3 | 299 | 5.0% |
| 5 | 256 | 4.3% |
| 9 | 254 | 4.2% |
| 4 | 216 | 3.6% |
| 6 | 178 | 3.0% |
| 8 | 169 | 2.8% |
| 7 | 150 | 2.5% |
| Value | Count | Frequency (%) |
| | 4 | |
| | 4 | |
| | 4 | |
| 2 | ||
| | 2 | |
| | 1 | 5.3% |
| | 1 | 5.3% |
| 1 | 5.3% |
| Value | Count | Frequency (%) |
| + | 53 | |
| = | 19 | 20.7% |
| < | 9 | 9.8% |
| > | 8 | 8.7% |
| ~ | 2 | 2.2% |
| | | 1 | 1.1% |
| Value | Count | Frequency (%) |
| ^ | 1 | |
| ´ | 1 | |
| ` | 1 |
| Value | Count | Frequency (%) |
| ( | 77 | |
| [ | 3 | 3.8% |
| Value | Count | Frequency (%) |
| ) | 100 | |
| ] | 4 | 3.8% |
| Value | Count | Frequency (%) |
| 66029 |
| Value | Count | Frequency (%) |
| - | 824 |
| Value | Count | Frequency (%) |
| _ | 213 |
| Value | Count | Frequency (%) |
| $ | 94 |
| Value | Count | Frequency (%) |
| ³ | 1 |
| Value | Count | Frequency (%) |
| ¦ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 604542 | |
| Common | 77898 | 11.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 65729 | 10.9% |
| n | 55657 | 9.2% |
| e | 54557 | 9.0% |
| a | 50167 | 8.3% |
| i | 43822 | 7.2% |
| t | 42600 | 7.0% |
| d | 30679 | 5.1% |
| r | 29153 | 4.8% |
| s | 28544 | 4.7% |
| l | 26300 | 4.4% |
| Other values (46) | 177334 |
| Value | Count | Frequency (%) |
| 66029 | ||
| 1 | 1691 | 2.2% |
| 0 | 1677 | 2.2% |
| ! | 1123 | 1.4% |
| 2 | 1105 | 1.4% |
| ' | 982 | 1.3% |
| - | 824 | 1.1% |
| . | 712 | 0.9% |
| / | 538 | 0.7% |
| , | 435 | 0.6% |
| Other values (42) | 2782 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 682408 | |
| None | 32 | < 0.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| 66029 | 9.7% | |
| o | 65729 | 9.6% |
| n | 55657 | 8.2% |
| e | 54557 | 8.0% |
| a | 50167 | 7.4% |
| i | 43822 | 6.4% |
| t | 42600 | 6.2% |
| d | 30679 | 4.5% |
| r | 29153 | 4.3% |
| s | 28544 | 4.2% |
| Other values (84) | 215471 |
| Value | Count | Frequency (%) |
| â | 4 | |
| | 4 | |
| î | 4 | |
| | 4 | |
| | 4 | |
| Ã | 2 | |
| | 2 | |
| ¦ | 2 | |
| | 1 | 3.1% |
| | 1 | 3.1% |
| Other values (4) | 4 |
addr_state
Categorical
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| CA | |
|---|---|
| NY | |
| FL | |
| TX | |
| NJ | 1850 |
| Other values (45) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 79434 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AZ |
|---|---|
| 2nd row | GA |
| 3rd row | IL |
| 4th row | CA |
| 5th row | OR |
| Value | Count | Frequency (%) |
| CA | 7099 | |
| NY | 3812 | 9.6% |
| FL | 2866 | 7.2% |
| TX | 2727 | 6.9% |
| NJ | 1850 | 4.7% |
| IL | 1525 | 3.8% |
| PA | 1517 | 3.8% |
| VA | 1407 | 3.5% |
| GA | 1398 | 3.5% |
| MA | 1340 | 3.4% |
| Other values (40) | 14176 |
| Value | Count | Frequency (%) |
| ca | 7099 | |
| ny | 3812 | 9.6% |
| fl | 2866 | 7.2% |
| tx | 2727 | 6.9% |
| nj | 1850 | 4.7% |
| il | 1525 | 3.8% |
| pa | 1517 | 3.8% |
| va | 1407 | 3.5% |
| ga | 1398 | 3.5% |
| ma | 1340 | 3.4% |
| Other values (40) | 14176 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 79434 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 79434 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79434 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 15698 | |
| C | 10116 | |
| N | 7953 | |
| L | 5279 | 6.6% |
| M | 4706 | 5.9% |
| Y | 4220 | 5.3% |
| T | 3892 | 4.9% |
| O | 3451 | 4.3% |
| I | 3097 | 3.9% |
| F | 2866 | 3.6% |
| Other values (14) | 18156 |
dti
Real number (ℝ≥0)
| Distinct | 2868 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.31512954 |
|---|---|
| Minimum | 0 |
| Maximum | 29.99 |
| Zeros | 183 |
| Zeros (%) | 0.5% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.13 |
| Q1 | 8.17 |
| median | 13.4 |
| Q3 | 18.6 |
| 95-th percentile | 23.84 |
| Maximum | 29.99 |
| Range | 29.99 |
| Interquartile range (IQR) | 10.43 |
Descriptive statistics
| Standard deviation | 6.678593595 |
|---|---|
| Coefficient of variation (CV) | 0.501579318 |
| Kurtosis | -0.8520154806 |
| Mean | 13.31512954 |
| Median Absolute Deviation (MAD) | 5.21 |
| Skewness | -0.02804333095 |
| Sum | 528837 |
| Variance | 44.6036124 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 183 | 0.5% |
| 12 | 51 | 0.1% |
| 18 | 45 | 0.1% |
| 19.2 | 40 | 0.1% |
| 13.2 | 39 | 0.1% |
| 16.8 | 38 | 0.1% |
| 12.48 | 38 | 0.1% |
| 13.5 | 38 | 0.1% |
| 6 | 37 | 0.1% |
| 14.29 | 36 | 0.1% |
| Other values (2858) | 39172 |
| Value | Count | Frequency (%) |
| 0 | 183 | |
| 0.01 | 3 | < 0.1% |
| 0.02 | 5 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 29.99 | 1 | < 0.1% |
| 29.95 | 1 | < 0.1% |
| 29.93 | 3 | |
| 29.92 | 2 | |
| 29.89 | 1 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1465115694 |
|---|---|
| Minimum | 0 |
| Maximum | 11 |
| Zeros | 35405 |
| Zeros (%) | 89.1% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.491811516 |
|---|---|
| Coefficient of variation (CV) | 3.356810102 |
| Kurtosis | 39.41249957 |
| Mean | 0.1465115694 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.022035213 |
| Sum | 5819 |
| Variance | 0.2418785673 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35405 | |
| 1 | 3303 | 8.3% |
| 2 | 687 | 1.7% |
| 3 | 220 | 0.6% |
| 4 | 62 | 0.2% |
| 5 | 22 | 0.1% |
| 6 | 10 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 35405 | |
| 1 | 3303 | 8.3% |
| 2 | 687 | 1.7% |
| 3 | 220 | 0.6% |
| 4 | 62 | 0.2% |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 10 |
| Distinct | 526 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| Nov-98 | 370 |
|---|---|
| Oct-99 | 366 |
| Dec-98 | 348 |
| Oct-00 | 346 |
| Dec-97 | 329 |
| Other values (521) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 238302 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Jan-85 |
|---|---|
| 2nd row | Apr-99 |
| 3rd row | Nov-01 |
| 4th row | Feb-96 |
| 5th row | Jan-96 |
| Value | Count | Frequency (%) |
| Nov-98 | 370 | 0.9% |
| Oct-99 | 366 | 0.9% |
| Dec-98 | 348 | 0.9% |
| Oct-00 | 346 | 0.9% |
| Dec-97 | 329 | 0.8% |
| Nov-00 | 320 | 0.8% |
| Nov-99 | 319 | 0.8% |
| Sep-00 | 306 | 0.8% |
| Oct-98 | 305 | 0.8% |
| Nov-97 | 298 | 0.8% |
| Other values (516) | 36410 |
| Value | Count | Frequency (%) |
| nov-98 | 370 | 0.9% |
| oct-99 | 366 | 0.9% |
| dec-98 | 348 | 0.9% |
| oct-00 | 346 | 0.9% |
| dec-97 | 329 | 0.8% |
| nov-00 | 320 | 0.8% |
| nov-99 | 319 | 0.8% |
| sep-00 | 306 | 0.8% |
| oct-98 | 305 | 0.8% |
| nov-97 | 298 | 0.8% |
| Other values (516) | 36410 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 39717 | |
| 9 | 23353 | 9.8% |
| 0 | 19365 | 8.1% |
| e | 10541 | 4.4% |
| J | 9426 | 4.0% |
| u | 9302 | 3.9% |
| a | 9126 | 3.8% |
| 8 | 8453 | 3.5% |
| c | 8143 | 3.4% |
| n | 6364 | 2.7% |
| Other values (23) | 94512 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79434 | |
| Decimal Number | 79434 | |
| Uppercase Letter | 39717 | |
| Dash Punctuation | 39717 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 10541 | |
| u | 9302 | |
| a | 9126 | |
| c | 8143 | |
| n | 6364 | |
| p | 6335 | |
| r | 5536 | |
| t | 4076 | 5.1% |
| o | 3930 | 4.9% |
| v | 3930 | 4.9% |
| Other values (4) | 12151 |
| Value | Count | Frequency (%) |
| 9 | 23353 | |
| 0 | 19365 | |
| 8 | 8453 | 10.6% |
| 7 | 4822 | 6.1% |
| 4 | 4274 | 5.4% |
| 5 | 4201 | 5.3% |
| 6 | 4174 | 5.3% |
| 3 | 3784 | 4.8% |
| 1 | 3736 | 4.7% |
| 2 | 3272 | 4.1% |
| Value | Count | Frequency (%) |
| J | 9426 | |
| A | 6047 | |
| M | 5697 | |
| O | 4076 | |
| D | 4067 | |
| N | 3930 | |
| S | 3593 | 9.0% |
| F | 2881 | 7.3% |
| Value | Count | Frequency (%) |
| - | 39717 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119151 | |
| Common | 119151 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 10541 | 8.8% |
| J | 9426 | 7.9% |
| u | 9302 | 7.8% |
| a | 9126 | 7.7% |
| c | 8143 | 6.8% |
| n | 6364 | 5.3% |
| p | 6335 | 5.3% |
| A | 6047 | 5.1% |
| M | 5697 | 4.8% |
| r | 5536 | 4.6% |
| Other values (12) | 42634 |
| Value | Count | Frequency (%) |
| - | 39717 | |
| 9 | 23353 | |
| 0 | 19365 | |
| 8 | 8453 | 7.1% |
| 7 | 4822 | 4.0% |
| 4 | 4274 | 3.6% |
| 5 | 4201 | 3.5% |
| 6 | 4174 | 3.5% |
| 3 | 3784 | 3.2% |
| 1 | 3736 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238302 |
Most frequent character per block
| Value | Count | Frequency (%) |
| - | 39717 | |
| 9 | 23353 | 9.8% |
| 0 | 19365 | 8.1% |
| e | 10541 | 4.4% |
| J | 9426 | 4.0% |
| u | 9302 | 3.9% |
| a | 9126 | 3.8% |
| 8 | 8453 | 3.5% |
| c | 8143 | 3.4% |
| n | 6364 | 2.7% |
| Other values (23) | 94512 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8691995871 |
|---|---|
| Minimum | 0 |
| Maximum | 8 |
| Zeros | 19300 |
| Zeros (%) | 48.6% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.070219332 |
|---|---|
| Coefficient of variation (CV) | 1.23126995 |
| Kurtosis | 2.562159858 |
| Mean | 0.8691995871 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.390390927 |
| Sum | 34522 |
| Variance | 1.145369419 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19300 | |
| 1 | 10971 | |
| 2 | 5812 | 14.6% |
| 3 | 3048 | 7.7% |
| 4 | 326 | 0.8% |
| 5 | 146 | 0.4% |
| 6 | 64 | 0.2% |
| 7 | 35 | 0.1% |
| 8 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 19300 | |
| 1 | 10971 | |
| 2 | 5812 | 14.6% |
| 3 | 3048 | 7.7% |
| 4 | 326 | 0.8% |
| Value | Count | Frequency (%) |
| 8 | 15 | < 0.1% |
| 7 | 35 | 0.1% |
| 6 | 64 | 0.2% |
| 5 | 146 | |
| 4 | 326 |
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 25682 |
| Missing (%) | 64.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.90096188 |
|---|---|
| Minimum | 0 |
| Maximum | 120 |
| Zeros | 443 |
| Zeros (%) | 1.1% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 18 |
| median | 34 |
| Q3 | 52 |
| 95-th percentile | 75 |
| Maximum | 120 |
| Range | 120 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 22.02005955 |
|---|---|
| Coefficient of variation (CV) | 0.6133556984 |
| Kurtosis | -0.8425777778 |
| Mean | 35.90096188 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.3064368727 |
| Sum | 503870 |
| Variance | 484.8830224 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 443 | 1.1% |
| 15 | 252 | 0.6% |
| 23 | 247 | 0.6% |
| 30 | 247 | 0.6% |
| 24 | 241 | 0.6% |
| 19 | 238 | 0.6% |
| 38 | 237 | 0.6% |
| 20 | 233 | 0.6% |
| 22 | 231 | 0.6% |
| 18 | 231 | 0.6% |
| Other values (85) | 11435 | |
| (Missing) | 25682 |
| Value | Count | Frequency (%) |
| 0 | 443 | |
| 1 | 30 | 0.1% |
| 2 | 101 | 0.3% |
| 3 | 145 | 0.4% |
| 4 | 153 | 0.4% |
| Value | Count | Frequency (%) |
| 120 | 1 | |
| 115 | 1 | |
| 107 | 1 | |
| 106 | 1 | |
| 103 | 2 |
open_acc
Real number (ℝ≥0)
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.294407936 |
|---|---|
| Minimum | 2 |
| Maximum | 44 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 9 |
| Q3 | 12 |
| 95-th percentile | 17 |
| Maximum | 44 |
| Range | 42 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.400282474 |
|---|---|
| Coefficient of variation (CV) | 0.4734333272 |
| Kurtosis | 1.67757203 |
| Mean | 9.294407936 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.00376191 |
| Sum | 369146 |
| Variance | 19.36248585 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 4018 | |
| 6 | 3946 | |
| 8 | 3936 | |
| 9 | 3718 | |
| 10 | 3223 | 8.1% |
| 5 | 3183 | 8.0% |
| 11 | 2746 | 6.9% |
| 4 | 2343 | 5.9% |
| 12 | 2273 | 5.7% |
| 13 | 1911 | 4.8% |
| Other values (30) | 8420 |
| Value | Count | Frequency (%) |
| 2 | 605 | 1.5% |
| 3 | 1493 | 3.8% |
| 4 | 2343 | |
| 5 | 3183 | |
| 6 | 3946 |
| Value | Count | Frequency (%) |
| 44 | 1 | |
| 42 | 1 | |
| 41 | 1 | |
| 39 | 1 | |
| 38 | 1 |
pub_rec
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 310.4 KiB |
| 0 | |
|---|---|
| 1 | 2056 |
| 2 | 51 |
| 3 | 7 |
| 4 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 39717 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 39717 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 39717 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39717 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 37601 | |
| 1 | 2056 | 5.2% |
| 2 | 51 | 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 2 | < 0.1% |
| Distinct | 21711 |
|---|---|
| Distinct (%) | 54.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13382.52809 |
|---|---|
| Minimum | 0 |
| Maximum | 149588 |
| Zeros | 994 |
| Zeros (%) | 2.5% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 321.8 |
| Q1 | 3703 |
| median | 8850 |
| Q3 | 17058 |
| 95-th percentile | 41656.4 |
| Maximum | 149588 |
| Range | 149588 |
| Interquartile range (IQR) | 13355 |
Descriptive statistics
| Standard deviation | 15885.01664 |
|---|---|
| Coefficient of variation (CV) | 1.186996697 |
| Kurtosis | 14.89652278 |
| Mean | 13382.52809 |
| Median Absolute Deviation (MAD) | 6027 |
| Skewness | 3.190883683 |
| Sum | 531513868 |
| Variance | 252333753.7 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 994 | 2.5% |
| 255 | 14 | < 0.1% |
| 298 | 14 | < 0.1% |
| 1 | 12 | < 0.1% |
| 682 | 11 | < 0.1% |
| 798 | 9 | < 0.1% |
| 346 | 9 | < 0.1% |
| 10 | 9 | < 0.1% |
| 865 | 9 | < 0.1% |
| 52 | 9 | < 0.1% |
| Other values (21701) | 38627 |
| Value | Count | Frequency (%) |
| 0 | 994 | |
| 1 | 12 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 149588 | 1 | |
| 149527 | 1 | |
| 149000 | 1 | |
| 148829 | 1 | |
| 148804 | 1 |
| Distinct | 1089 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 50 |
| Missing (%) | 0.1% |
| Memory size | 310.4 KiB |
| 0% | 977 |
|---|---|
| 0.20% | 63 |
| 63% | 62 |
| 66.70% | 58 |
| 0.10% | 58 |
| Other values (1084) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.521919984 |
| Min length | 2 |
Characters and Unicode
| Total characters | 219038 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 89 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 83.70% |
|---|---|
| 2nd row | 9.40% |
| 3rd row | 98.50% |
| 4th row | 21% |
| 5th row | 53.90% |
| Value | Count | Frequency (%) |
| 0% | 977 | 2.5% |
| 0.20% | 63 | 0.2% |
| 63% | 62 | 0.2% |
| 66.70% | 58 | 0.1% |
| 0.10% | 58 | 0.1% |
| 40.70% | 58 | 0.1% |
| 46.40% | 57 | 0.1% |
| 31.20% | 57 | 0.1% |
| 66.60% | 57 | 0.1% |
| 61% | 57 | 0.1% |
| Other values (1079) | 38163 |
| Value | Count | Frequency (%) |
| 0 | 977 | 2.5% |
| 0.20 | 63 | 0.2% |
| 63 | 62 | 0.2% |
| 40.70 | 58 | 0.1% |
| 0.10 | 58 | 0.1% |
| 66.70 | 58 | 0.1% |
| 66.60 | 57 | 0.1% |
| 61 | 57 | 0.1% |
| 31.20 | 57 | 0.1% |
| 46.40 | 57 | 0.1% |
| Other values (1079) | 38163 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 39671 | |
| % | 39667 | |
| . | 34841 | |
| 4 | 12082 | 5.5% |
| 5 | 12063 | 5.5% |
| 6 | 11989 | 5.5% |
| 7 | 11949 | 5.5% |
| 3 | 11885 | 5.4% |
| 2 | 11550 | 5.3% |
| 8 | 11419 | 5.2% |
| Other values (2) | 21922 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 144530 | |
| Other Punctuation | 74508 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 39671 | |
| 4 | 12082 | 8.4% |
| 5 | 12063 | 8.3% |
| 6 | 11989 | 8.3% |
| 7 | 11949 | 8.3% |
| 3 | 11885 | 8.2% |
| 2 | 11550 | 8.0% |
| 8 | 11419 | 7.9% |
| 1 | 11111 | 7.7% |
| 9 | 10811 | 7.5% |
| Value | Count | Frequency (%) |
| % | 39667 | |
| . | 34841 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 219038 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 39671 | |
| % | 39667 | |
| . | 34841 | |
| 4 | 12082 | 5.5% |
| 5 | 12063 | 5.5% |
| 6 | 11989 | 5.5% |
| 7 | 11949 | 5.5% |
| 3 | 11885 | 5.4% |
| 2 | 11550 | 5.3% |
| 8 | 11419 | 5.2% |
| Other values (2) | 21922 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 219038 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 39671 | |
| % | 39667 | |
| . | 34841 | |
| 4 | 12082 | 5.5% |
| 5 | 12063 | 5.5% |
| 6 | 11989 | 5.5% |
| 7 | 11949 | 5.5% |
| 3 | 11885 | 5.4% |
| 2 | 11550 | 5.3% |
| 8 | 11419 | 5.2% |
| Other values (2) | 21922 |
total_acc
Real number (ℝ≥0)
| Distinct | 82 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.08882846 |
|---|---|
| Minimum | 2 |
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 13 |
| median | 20 |
| Q3 | 29 |
| 95-th percentile | 43 |
| Maximum | 90 |
| Range | 88 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 11.40170855 |
|---|---|
| Coefficient of variation (CV) | 0.5161753405 |
| Kurtosis | 0.6937402027 |
| Mean | 22.08882846 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.8273790855 |
| Sum | 877302 |
| Variance | 129.9989579 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 1471 | 3.7% |
| 15 | 1462 | 3.7% |
| 17 | 1457 | 3.7% |
| 14 | 1445 | 3.6% |
| 20 | 1428 | 3.6% |
| 18 | 1422 | 3.6% |
| 21 | 1412 | 3.6% |
| 13 | 1385 | 3.5% |
| 19 | 1341 | 3.4% |
| 12 | 1325 | 3.3% |
| Other values (72) | 25569 |
| Value | Count | Frequency (%) |
| 2 | 4 | < 0.1% |
| 3 | 182 | 0.5% |
| 4 | 420 | |
| 5 | 552 | |
| 6 | 683 |
| Value | Count | Frequency (%) |
| 90 | 1 | |
| 87 | 1 | |
| 81 | 1 | |
| 80 | 1 | |
| 79 | 2 |
| Distinct | 1137 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.2278873 |
|---|---|
| Minimum | 0 |
| Maximum | 6311.47 |
| Zeros | 38577 |
| Zeros (%) | 97.1% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6311.47 |
| Range | 6311.47 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 375.1728389 |
|---|---|
| Coefficient of variation (CV) | 7.323605532 |
| Kurtosis | 97.6585546 |
| Mean | 51.2278873 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.226730006 |
| Sum | 2034618 |
| Variance | 140754.659 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 2277.11 | 2 | < 0.1% |
| 2963.24 | 2 | < 0.1% |
| 827.13 | 2 | < 0.1% |
| 1972.6 | 2 | < 0.1% |
| 1202.05 | 1 | < 0.1% |
| 4316.13 | 1 | < 0.1% |
| 3006.67 | 1 | < 0.1% |
| 1725.34 | 1 | < 0.1% |
| 743.52 | 1 | < 0.1% |
| Other values (1127) | 1127 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 10.26 | 1 | < 0.1% |
| 11.91 | 1 | < 0.1% |
| 13.28 | 1 | < 0.1% |
| 19.12 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6311.47 | 1 | |
| 6308.37 | 1 | |
| 6307.37 | 1 | |
| 6307.15 | 1 | |
| 6219.16 | 1 |
| Distinct | 1138 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.98976811 |
|---|---|
| Minimum | 0 |
| Maximum | 6307.37 |
| Zeros | 38577 |
| Zeros (%) | 97.1% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6307.37 |
| Range | 6307.37 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 373.8244569 |
|---|---|
| Coefficient of variation (CV) | 7.331362169 |
| Kurtosis | 98.04055348 |
| Mean | 50.98976811 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.243765495 |
| Sum | 2025160.62 |
| Variance | 139744.7246 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 1972.6 | 2 | < 0.1% |
| 827.13 | 2 | < 0.1% |
| 1664.64 | 2 | < 0.1% |
| 1212.39 | 1 | < 0.1% |
| 1662.57 | 1 | < 0.1% |
| 3335.41 | 1 | < 0.1% |
| 3131.63 | 1 | < 0.1% |
| 272.65 | 1 | < 0.1% |
| 87.94 | 1 | < 0.1% |
| Other values (1128) | 1128 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 38577 | |
| 10.26 | 1 | < 0.1% |
| 11.91 | 1 | < 0.1% |
| 13.28 | 1 | < 0.1% |
| 19.09 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6307.37 | 1 | |
| 6306.96 | 1 | |
| 6298.11 | 1 | |
| 6276.75 | 1 | |
| 6219.16 | 1 |
| Distinct | 37850 |
|---|---|
| Distinct (%) | 95.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12153.59654 |
|---|---|
| Minimum | 0 |
| Maximum | 58563.67993 |
| Zeros | 16 |
| Zeros (%) | < 0.1% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1887.957036 |
| Q1 | 5576.93 |
| median | 9899.640319 |
| Q3 | 16534.43304 |
| 95-th percentile | 30245.11853 |
| Maximum | 58563.67993 |
| Range | 58563.67993 |
| Interquartile range (IQR) | 10957.50304 |
Descriptive statistics
| Standard deviation | 9042.040766 |
|---|---|
| Coefficient of variation (CV) | 0.743980659 |
| Kurtosis | 1.985894249 |
| Mean | 12153.59654 |
| Median Absolute Deviation (MAD) | 5016.756711 |
| Skewness | 1.339857366 |
| Sum | 482704393.9 |
| Variance | 81758501.21 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 11196.56943 | 26 | 0.1% |
| 0 | 16 | < 0.1% |
| 10956.77596 | 16 | < 0.1% |
| 11784.23223 | 16 | < 0.1% |
| 5478.387981 | 15 | < 0.1% |
| 13148.13786 | 15 | < 0.1% |
| 5557.025543 | 13 | < 0.1% |
| 13435.90021 | 13 | < 0.1% |
| 13263.95464 | 12 | < 0.1% |
| 14288.76169 | 11 | < 0.1% |
| Other values (37840) | 39564 |
| Value | Count | Frequency (%) |
| 0 | 16 | |
| 33.73 | 1 | < 0.1% |
| 35.71 | 1 | < 0.1% |
| 44.92 | 2 | < 0.1% |
| 44.96 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 58563.67993 | 1 | |
| 58480.13992 | 1 | |
| 57835.27991 | 1 | |
| 56849.26986 | 1 | |
| 56662.58994 | 1 |
| Distinct | 37518 |
|---|---|
| Distinct (%) | 94.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11567.14912 |
|---|---|
| Minimum | 0 |
| Maximum | 58563.68 |
| Zeros | 165 |
| Zeros (%) | 0.4% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1420.408 |
| Q1 | 5112.31 |
| median | 9287.15 |
| Q3 | 15798.81 |
| 95-th percentile | 29627.236 |
| Maximum | 58563.68 |
| Range | 58563.68 |
| Interquartile range (IQR) | 10686.5 |
Descriptive statistics
| Standard deviation | 8942.672613 |
|---|---|
| Coefficient of variation (CV) | 0.7731094777 |
| Kurtosis | 2.029758507 |
| Mean | 11567.14912 |
| Median Absolute Deviation (MAD) | 4939.58 |
| Skewness | 1.35483764 |
| Sum | 459412461.5 |
| Variance | 79971393.47 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 165 | 0.4% |
| 6514.52 | 16 | < 0.1% |
| 5478.39 | 14 | < 0.1% |
| 13148.14 | 14 | < 0.1% |
| 6717.95 | 12 | < 0.1% |
| 10956.78 | 12 | < 0.1% |
| 11196.57 | 12 | < 0.1% |
| 5557.03 | 11 | < 0.1% |
| 7328.92 | 11 | < 0.1% |
| 13517.36 | 11 | < 0.1% |
| Other values (37508) | 39439 |
| Value | Count | Frequency (%) |
| 0 | 165 | |
| 0.54 | 1 | < 0.1% |
| 12.65 | 1 | < 0.1% |
| 18.97 | 1 | < 0.1% |
| 21.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 58563.68 | 1 | |
| 58438.37 | 1 | |
| 57628.73 | 1 | |
| 56622.12 | 1 | |
| 56515.16 | 1 |
| Distinct | 7976 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9793.348813 |
|---|---|
| Minimum | 0 |
| Maximum | 35000.02 |
| Zeros | 74 |
| Zeros (%) | 0.2% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1339.842 |
| Q1 | 4600 |
| median | 8000 |
| Q3 | 13653.26 |
| 95-th percentile | 24999.982 |
| Maximum | 35000.02 |
| Range | 35000.02 |
| Interquartile range (IQR) | 9053.26 |
Descriptive statistics
| Standard deviation | 7065.522127 |
|---|---|
| Coefficient of variation (CV) | 0.7214612961 |
| Kurtosis | 1.103355455 |
| Mean | 9793.348813 |
| Median Absolute Deviation (MAD) | 4000 |
| Skewness | 1.118254546 |
| Sum | 388962434.8 |
| Variance | 49921602.93 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 2293 | 5.8% |
| 12000 | 1805 | 4.5% |
| 5000 | 1702 | 4.3% |
| 6000 | 1637 | 4.1% |
| 15000 | 1400 | 3.5% |
| 8000 | 1318 | 3.3% |
| 20000 | 1059 | 2.7% |
| 4000 | 956 | 2.4% |
| 3000 | 883 | 2.2% |
| 7000 | 851 | 2.1% |
| Other values (7966) | 25813 |
| Value | Count | Frequency (%) |
| 0 | 74 | |
| 21.21 | 1 | < 0.1% |
| 21.93 | 1 | < 0.1% |
| 22.24 | 1 | < 0.1% |
| 22.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000.02 | 2 | < 0.1% |
| 35000.01 | 1 | < 0.1% |
| 35000 | 363 | |
| 34999.99 | 5 | < 0.1% |
| 34999.98 | 1 | < 0.1% |
total_rec_int
Real number (ℝ≥0)
| Distinct | 35148 |
|---|---|
| Distinct (%) | 88.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2263.663172 |
|---|---|
| Minimum | 0 |
| Maximum | 23563.68 |
| Zeros | 71 |
| Zeros (%) | 0.2% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 186.168 |
| Q1 | 662.18 |
| median | 1348.91 |
| Q3 | 2833.4 |
| 95-th percentile | 7575.812 |
| Maximum | 23563.68 |
| Range | 23563.68 |
| Interquartile range (IQR) | 2171.22 |
Descriptive statistics
| Standard deviation | 2608.111964 |
|---|---|
| Coefficient of variation (CV) | 1.152164331 |
| Kurtosis | 9.688278395 |
| Mean | 2263.663172 |
| Median Absolute Deviation (MAD) | 866.01 |
| Skewness | 2.668747187 |
| Sum | 89905910.21 |
| Variance | 6802248.019 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 71 | 0.2% |
| 1196.57 | 26 | 0.1% |
| 514.52 | 19 | < 0.1% |
| 1784.23 | 17 | < 0.1% |
| 717.95 | 17 | < 0.1% |
| 1148.14 | 17 | < 0.1% |
| 956.78 | 17 | < 0.1% |
| 478.39 | 16 | < 0.1% |
| 1907.35 | 14 | < 0.1% |
| 632.21 | 13 | < 0.1% |
| Other values (35138) | 39490 |
| Value | Count | Frequency (%) |
| 0 | 71 | |
| 6.22 | 1 | < 0.1% |
| 6.27 | 1 | < 0.1% |
| 7.19 | 1 | < 0.1% |
| 7.2 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 23563.68 | 1 | |
| 23506.56 | 1 | |
| 23480.14 | 1 | |
| 22835.28 | 1 | |
| 22716.42 | 1 |
| Distinct | 1356 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.363015212 |
|---|---|
| Minimum | 0 |
| Maximum | 180.2 |
| Zeros | 37671 |
| Zeros (%) | 94.8% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 14.924199 |
| Maximum | 180.2 |
| Range | 180.2 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.289979302 |
|---|---|
| Coefficient of variation (CV) | 5.348421085 |
| Kurtosis | 100.8515437 |
| Mean | 1.363015212 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.429536 |
| Sum | 54134.87519 |
| Variance | 53.14379822 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37671 | |
| 15 | 255 | 0.6% |
| 15.00000001 | 58 | 0.1% |
| 30 | 55 | 0.1% |
| 15.00000002 | 47 | 0.1% |
| 14.99999999 | 40 | 0.1% |
| 14.99999998 | 33 | 0.1% |
| 15.00000003 | 32 | 0.1% |
| 15.00000004 | 25 | 0.1% |
| 14.99999997 | 25 | 0.1% |
| Other values (1346) | 1476 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 37671 | |
| 0.01 | 1 | < 0.1% |
| 0.060799751 | 1 | < 0.1% |
| 0.073787104 | 1 | < 0.1% |
| 0.101704562 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 180.2 | 1 | |
| 166.4297107 | 1 | |
| 165.69 | 1 | |
| 146.6000003 | 1 | |
| 146.04 | 1 |
| Distinct | 4040 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.22162387 |
|---|---|
| Minimum | 0 |
| Maximum | 29623.35 |
| Zeros | 35499 |
| Zeros (%) | 89.4% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 362.418 |
| Maximum | 29623.35 |
| Range | 29623.35 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 688.744771 |
|---|---|
| Coefficient of variation (CV) | 7.23307105 |
| Kurtosis | 379.3775773 |
| Mean | 95.22162387 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.5193782 |
| Sum | 3781917.235 |
| Variance | 474369.3595 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35499 | |
| 10.4 | 4 | < 0.1% |
| 11.29 | 4 | < 0.1% |
| 10.66 | 3 | < 0.1% |
| 13.59 | 3 | < 0.1% |
| 13.93 | 3 | < 0.1% |
| 164.81 | 3 | < 0.1% |
| 19.2 | 3 | < 0.1% |
| 10.07 | 3 | < 0.1% |
| 10.13 | 3 | < 0.1% |
| Other values (4030) | 4189 | 10.5% |
| Value | Count | Frequency (%) |
| 0 | 35499 | |
| 6.3 | 1 | < 0.1% |
| 6.31 | 1 | < 0.1% |
| 8.19 | 1 | < 0.1% |
| 8.36 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 29623.35 | 1 | |
| 22943.37 | 1 | |
| 21810.31 | 1 | |
| 20006.53 | 1 | |
| 19915.67 | 1 |
| Distinct | 2616 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.40611189 |
|---|---|
| Minimum | 0 |
| Maximum | 7002.19 |
| Zeros | 35935 |
| Zeros (%) | 90.5% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 5.152 |
| Maximum | 7002.19 |
| Range | 7002.19 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 148.6715935 |
|---|---|
| Coefficient of variation (CV) | 11.98373791 |
| Kurtosis | 821.3006591 |
| Mean | 12.40611189 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 25.02941842 |
| Sum | 492733.5461 |
| Variance | 22103.2427 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 35935 | |
| 2 | 12 | < 0.1% |
| 1.2 | 10 | < 0.1% |
| 3.71 | 9 | < 0.1% |
| 1.88 | 8 | < 0.1% |
| 0.8 | 8 | < 0.1% |
| 1.69 | 8 | < 0.1% |
| 1.21 | 8 | < 0.1% |
| 2.02 | 8 | < 0.1% |
| 1.6 | 8 | < 0.1% |
| Other values (2606) | 3703 | 9.3% |
| Value | Count | Frequency (%) |
| 0 | 35935 | |
| 0.063 | 1 | < 0.1% |
| 0.074500001 | 1 | < 0.1% |
| 0.134799995 | 1 | < 0.1% |
| 0.1393 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7002.19 | 1 | |
| 6972.59 | 1 | |
| 6543.04 | 1 | |
| 5774.8 | 1 | |
| 5602.72 | 1 |
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 71 |
| Missing (%) | 0.2% |
| Memory size | 310.4 KiB |
| May-16 | 1256 |
|---|---|
| Mar-13 | 1026 |
| Dec-14 | 945 |
| May-13 | 907 |
| Feb-13 | 869 |
| Other values (96) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 237876 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Jan-15 |
|---|---|
| 2nd row | Apr-13 |
| 3rd row | Jun-14 |
| 4th row | Jan-15 |
| 5th row | May-16 |
| Value | Count | Frequency (%) |
| May-16 | 1256 | 3.2% |
| Mar-13 | 1026 | 2.6% |
| Dec-14 | 945 | 2.4% |
| May-13 | 907 | 2.3% |
| Feb-13 | 869 | 2.2% |
| Apr-13 | 851 | 2.1% |
| Mar-12 | 844 | 2.1% |
| Aug-14 | 832 | 2.1% |
| Jan-14 | 832 | 2.1% |
| Aug-12 | 832 | 2.1% |
| Other values (91) | 30452 |
| Value | Count | Frequency (%) |
| may-16 | 1256 | 3.2% |
| mar-13 | 1026 | 2.6% |
| dec-14 | 945 | 2.4% |
| may-13 | 907 | 2.3% |
| feb-13 | 869 | 2.2% |
| apr-13 | 851 | 2.1% |
| mar-12 | 844 | 2.1% |
| aug-12 | 832 | 2.1% |
| jan-14 | 832 | 2.1% |
| aug-14 | 832 | 2.1% |
| Other values (91) | 30452 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 43946 | |
| - | 39646 | |
| a | 11087 | 4.7% |
| e | 9738 | 4.1% |
| 3 | 9458 | 4.0% |
| u | 9401 | 4.0% |
| 4 | 9269 | 3.9% |
| J | 9200 | 3.9% |
| 2 | 8904 | 3.7% |
| M | 8046 | 3.4% |
| Other values (22) | 79181 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79292 | |
| Decimal Number | 79292 | |
| Uppercase Letter | 39646 | |
| Dash Punctuation | 39646 |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 11087 | |
| e | 9738 | |
| u | 9401 | |
| r | 6965 | |
| c | 6783 | |
| p | 6219 | |
| n | 5974 | |
| y | 4285 | 5.4% |
| t | 3271 | 4.1% |
| g | 3242 | 4.1% |
| Other values (4) | 12327 |
| Value | Count | Frequency (%) |
| 1 | 43946 | |
| 3 | 9458 | 11.9% |
| 4 | 9269 | 11.7% |
| 2 | 8904 | 11.2% |
| 0 | 2544 | 3.2% |
| 5 | 2431 | 3.1% |
| 6 | 2044 | 2.6% |
| 9 | 559 | 0.7% |
| 8 | 137 | 0.2% |
| Value | Count | Frequency (%) |
| J | 9200 | |
| M | 8046 | |
| A | 6446 | |
| D | 3512 | 8.9% |
| O | 3271 | 8.3% |
| F | 3211 | 8.1% |
| S | 3015 | 7.6% |
| N | 2945 | 7.4% |
| Value | Count | Frequency (%) |
| - | 39646 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 118938 | |
| Common | 118938 |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 11087 | 9.3% |
| e | 9738 | 8.2% |
| u | 9401 | 7.9% |
| J | 9200 | 7.7% |
| M | 8046 | 6.8% |
| r | 6965 | 5.9% |
| c | 6783 | 5.7% |
| A | 6446 | 5.4% |
| p | 6219 | 5.2% |
| n | 5974 | 5.0% |
| Other values (12) | 39079 |
| Value | Count | Frequency (%) |
| 1 | 43946 | |
| - | 39646 | |
| 3 | 9458 | 8.0% |
| 4 | 9269 | 7.8% |
| 2 | 8904 | 7.5% |
| 0 | 2544 | 2.1% |
| 5 | 2431 | 2.0% |
| 6 | 2044 | 1.7% |
| 9 | 559 | 0.5% |
| 8 | 137 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 237876 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 43946 | |
| - | 39646 | |
| a | 11087 | 4.7% |
| e | 9738 | 4.1% |
| 3 | 9458 | 4.0% |
| u | 9401 | 4.0% |
| 4 | 9269 | 3.9% |
| J | 9200 | 3.9% |
| 2 | 8904 | 3.7% |
| M | 8046 | 3.4% |
| Other values (22) | 79181 |
last_pymnt_amnt
Real number (ℝ≥0)
| Distinct | 34930 |
|---|---|
| Distinct (%) | 87.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2678.826162 |
|---|---|
| Minimum | 0 |
| Maximum | 36115.2 |
| Zeros | 74 |
| Zeros (%) | 0.2% |
| Memory size | 310.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 43.34 |
| Q1 | 218.68 |
| median | 546.14 |
| Q3 | 3293.16 |
| 95-th percentile | 12183.944 |
| Maximum | 36115.2 |
| Range | 36115.2 |
| Interquartile range (IQR) | 3074.48 |
Descriptive statistics
| Standard deviation | 4447.136012 |
|---|---|
| Coefficient of variation (CV) | 1.660106234 |
| Kurtosis | 8.867819694 |
| Mean | 2678.826162 |
| Median Absolute Deviation (MAD) | 449.45 |
| Skewness | 2.712122241 |
| Sum | 106394938.7 |
| Variance | 19777018.71 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 74 | 0.2% |
| 276.06 | 21 | 0.1% |
| 200 | 17 | < 0.1% |
| 50 | 16 | < 0.1% |
| 100 | 15 | < 0.1% |
| 400 | 12 | < 0.1% |
| 773.44 | 12 | < 0.1% |
| 150 | 11 | < 0.1% |
| 786.01 | 11 | < 0.1% |
| 500 | 11 | < 0.1% |
| Other values (34920) | 39517 |
| Value | Count | Frequency (%) |
| 0 | 74 | |
| 0.01 | 1 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.03 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 36115.2 | 1 | |
| 35613.68 | 1 | |
| 35596.41 | 1 | |
| 35479.89 | 1 | |
| 35471.86 | 1 |
| Distinct | 106 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 310.4 KiB |
| May-16 | |
|---|---|
| Apr-16 | |
| Mar-16 | 1123 |
| Feb-13 | 843 |
| Feb-16 | 736 |
| Other values (101) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 238290 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | May-16 |
|---|---|
| 2nd row | Sep-13 |
| 3rd row | May-16 |
| 4th row | Apr-16 |
| 5th row | May-16 |
| Value | Count | Frequency (%) |
| May-16 | 10308 | |
| Apr-16 | 2547 | 6.4% |
| Mar-16 | 1123 | 2.8% |
| Feb-13 | 843 | 2.1% |
| Feb-16 | 736 | 1.9% |
| Jan-16 | 657 | 1.7% |
| Dec-15 | 647 | 1.6% |
| Mar-13 | 577 | 1.5% |
| Mar-14 | 564 | 1.4% |
| Dec-14 | 562 | 1.4% |
| Other values (96) | 21151 |
| Value | Count | Frequency (%) |
| may-16 | 10308 | |
| apr-16 | 2547 | 6.4% |
| mar-16 | 1123 | 2.8% |
| feb-13 | 843 | 2.1% |
| feb-16 | 736 | 1.9% |
| jan-16 | 657 | 1.7% |
| dec-15 | 647 | 1.6% |
| mar-13 | 577 | 1.5% |
| mar-14 | 564 | 1.4% |
| dec-14 | 562 | 1.4% |
| Other values (96) | 21151 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 41601 | |
| - | 39715 | |
| a | 17601 | 7.4% |
| M | 15523 | 6.5% |
| 6 | 15371 | 6.5% |
| y | 12231 | 5.1% |
| r | 7664 | 3.2% |
| e | 7600 | 3.2% |
| p | 6483 | 2.7% |
| A | 6411 | 2.7% |
| Other values (23) | 68090 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79430 | |
| Decimal Number | 79430 | |
| Uppercase Letter | 39715 | |
| Dash Punctuation | 39715 |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 17601 | |
| y | 12231 | |
| r | 7664 | |
| e | 7600 | |
| p | 6483 | 8.2% |
| u | 5856 | 7.4% |
| c | 4475 | 5.6% |
| n | 3834 | 4.8% |
| b | 3075 | 3.9% |
| o | 2225 | 2.8% |
| Other values (4) | 8386 |
| Value | Count | Frequency (%) |
| 1 | 41601 | |
| 6 | 15371 | 19.4% |
| 4 | 6255 | 7.9% |
| 5 | 5502 | 6.9% |
| 3 | 5164 | 6.5% |
| 2 | 4079 | 5.1% |
| 0 | 1153 | 1.5% |
| 9 | 228 | 0.3% |
| 8 | 41 | 0.1% |
| 7 | 36 | < 0.1% |
| Value | Count | Frequency (%) |
| M | 15523 | |
| A | 6411 | |
| J | 5895 | 14.8% |
| F | 3075 | 7.7% |
| D | 2414 | 6.1% |
| N | 2225 | 5.6% |
| S | 2111 | 5.3% |
| O | 2061 | 5.2% |
| Value | Count | Frequency (%) |
| - | 39715 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119145 | |
| Common | 119145 |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 17601 | |
| M | 15523 | |
| y | 12231 | |
| r | 7664 | 6.4% |
| e | 7600 | 6.4% |
| p | 6483 | 5.4% |
| A | 6411 | 5.4% |
| J | 5895 | 4.9% |
| u | 5856 | 4.9% |
| c | 4475 | 3.8% |
| Other values (12) | 29406 |
| Value | Count | Frequency (%) |
| 1 | 41601 | |
| - | 39715 | |
| 6 | 15371 | 12.9% |
| 4 | 6255 | 5.2% |
| 5 | 5502 | 4.6% |
| 3 | 5164 | 4.3% |
| 2 | 4079 | 3.4% |
| 0 | 1153 | 1.0% |
| 9 | 228 | 0.2% |
| 8 | 41 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238290 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 41601 | |
| - | 39715 | |
| a | 17601 | 7.4% |
| M | 15523 | 6.5% |
| 6 | 15371 | 6.5% |
| y | 12231 | 5.1% |
| r | 7664 | 3.2% |
| e | 7600 | 3.2% |
| p | 6483 | 2.7% |
| A | 6411 | 2.7% |
| Other values (23) | 68090 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 697 |
| Missing (%) | 1.8% |
| Memory size | 310.4 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1674 |
| 2.0 | 7 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 117060 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 37339 | |
| 1.0 | 1674 | 4.2% |
| 2.0 | 7 | < 0.1% |
| (Missing) | 697 | 1.8% |
| Value | Count | Frequency (%) |
| 0.0 | 37339 | |
| 1.0 | 1674 | 4.3% |
| 2.0 | 7 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| . | 39020 | |
| 1 | 1674 | 1.4% |
| 2 | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 78040 | |
| Other Punctuation | 39020 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| 1 | 1674 | 2.1% |
| 2 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| . | 39020 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 117060 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| . | 39020 | |
| 1 | 1674 | 1.4% |
| 2 | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117060 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 76359 | |
| . | 39020 | |
| 1 | 1674 | 1.4% |
| 2 | 7 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| loan_amnt | funded_amnt | funded_amnt_inv | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | purpose | title | addr_state | dti | delinq_2yrs | earliest_cr_line | inq_last_6mths | mths_since_last_delinq | open_acc | pub_rec | revol_bal | revol_util | total_acc | out_prncp | out_prncp_inv | total_pymnt | total_pymnt_inv | total_rec_prncp | total_rec_int | total_rec_late_fee | recoveries | collection_recovery_fee | last_pymnt_d | last_pymnt_amnt | last_credit_pull_d | pub_rec_bankruptcies | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5000 | 5000 | 4,975.00 | 36 months | 10.65% | 162.87 | B | B2 | NaN | 10+ years | RENT | 24,000.00 | Verified | Dec-11 | Fully Paid | credit_card | Computer | AZ | 27.65 | 0 | Jan-85 | 1 | NaN | 3 | 0 | 13648 | 83.70% | 9 | 0.00 | 0.00 | 5,863.16 | 5,833.84 | 5,000.00 | 863.16 | 0.00 | 0.00 | 0.00 | Jan-15 | 171.62 | May-16 | 0.00 |
| 1 | 2500 | 2500 | 2,500.00 | 60 months | 15.27% | 59.83 | C | C4 | Ryder | < 1 year | RENT | 30,000.00 | Source Verified | Dec-11 | Charged Off | car | bike | GA | 1.00 | 0 | Apr-99 | 5 | NaN | 3 | 0 | 1687 | 9.40% | 4 | 0.00 | 0.00 | 1,008.71 | 1,008.71 | 456.46 | 435.17 | 0.00 | 117.08 | 1.11 | Apr-13 | 119.66 | Sep-13 | 0.00 |
| 2 | 2400 | 2400 | 2,400.00 | 36 months | 15.96% | 84.33 | C | C5 | NaN | 10+ years | RENT | 12,252.00 | Not Verified | Dec-11 | Fully Paid | small_business | real estate business | IL | 8.72 | 0 | Nov-01 | 2 | NaN | 2 | 0 | 2956 | 98.50% | 10 | 0.00 | 0.00 | 3,005.67 | 3,005.67 | 2,400.00 | 605.67 | 0.00 | 0.00 | 0.00 | Jun-14 | 649.91 | May-16 | 0.00 |
| 3 | 10000 | 10000 | 10,000.00 | 36 months | 13.49% | 339.31 | C | C1 | AIR RESOURCES BOARD | 10+ years | RENT | 49,200.00 | Source Verified | Dec-11 | Fully Paid | other | personel | CA | 20.00 | 0 | Feb-96 | 1 | 35.00 | 10 | 0 | 5598 | 21% | 37 | 0.00 | 0.00 | 12,231.89 | 12,231.89 | 10,000.00 | 2,214.92 | 16.97 | 0.00 | 0.00 | Jan-15 | 357.48 | Apr-16 | 0.00 |
| 4 | 3000 | 3000 | 3,000.00 | 60 months | 12.69% | 67.79 | B | B5 | University Medical Group | 1 year | RENT | 80,000.00 | Source Verified | Dec-11 | Current | other | Personal | OR | 17.94 | 0 | Jan-96 | 0 | 38.00 | 15 | 0 | 27783 | 53.90% | 38 | 524.06 | 524.06 | 3,513.33 | 3,513.33 | 2,475.94 | 1,037.39 | 0.00 | 0.00 | 0.00 | May-16 | 67.79 | May-16 | 0.00 |
| 5 | 5000 | 5000 | 5,000.00 | 36 months | 7.90% | 156.46 | A | A4 | Veolia Transportaton | 3 years | RENT | 36,000.00 | Source Verified | Dec-11 | Fully Paid | wedding | My wedding loan I promise to pay back | AZ | 11.20 | 0 | Nov-04 | 3 | NaN | 9 | 0 | 7963 | 28.30% | 12 | 0.00 | 0.00 | 5,632.21 | 5,632.21 | 5,000.00 | 632.21 | 0.00 | 0.00 | 0.00 | Jan-15 | 161.03 | Jan-16 | 0.00 |
| 6 | 7000 | 7000 | 7,000.00 | 60 months | 15.96% | 170.08 | C | C5 | Southern Star Photography | 8 years | RENT | 47,004.00 | Not Verified | Dec-11 | Fully Paid | debt_consolidation | Loan | NC | 23.51 | 0 | Jul-05 | 1 | NaN | 7 | 0 | 17726 | 85.60% | 11 | 0.00 | 0.00 | 10,110.84 | 10,110.84 | 6,985.61 | 3,125.23 | 0.00 | 0.00 | 0.00 | May-16 | 1,313.76 | May-16 | 0.00 |
| 7 | 3000 | 3000 | 3,000.00 | 36 months | 18.64% | 109.43 | E | E1 | MKC Accounting | 9 years | RENT | 48,000.00 | Source Verified | Dec-11 | Fully Paid | car | Car Downpayment | CA | 5.35 | 0 | Jan-07 | 2 | NaN | 4 | 0 | 8221 | 87.50% | 4 | 0.00 | 0.00 | 3,939.14 | 3,939.14 | 3,000.00 | 939.14 | 0.00 | 0.00 | 0.00 | Jan-15 | 111.34 | Dec-14 | 0.00 |
| 8 | 5600 | 5600 | 5,600.00 | 60 months | 21.28% | 152.39 | F | F2 | NaN | 4 years | OWN | 40,000.00 | Source Verified | Dec-11 | Charged Off | small_business | Expand Business & Buy Debt Portfolio | CA | 5.55 | 0 | Apr-04 | 2 | NaN | 11 | 0 | 5210 | 32.60% | 13 | 0.00 | 0.00 | 646.02 | 646.02 | 162.02 | 294.94 | 0.00 | 189.06 | 2.09 | Apr-12 | 152.39 | Aug-12 | 0.00 |
| 9 | 5375 | 5375 | 5,350.00 | 60 months | 12.69% | 121.45 | B | B5 | Starbucks | < 1 year | RENT | 15,000.00 | Verified | Dec-11 | Charged Off | other | Building my credit history. | TX | 18.08 | 0 | Sep-04 | 0 | NaN | 2 | 0 | 9279 | 36.50% | 3 | 0.00 | 0.00 | 1,476.19 | 1,469.34 | 673.48 | 533.42 | 0.00 | 269.29 | 2.52 | Nov-12 | 121.45 | Mar-13 | 0.00 |
Last rows
| loan_amnt | funded_amnt | funded_amnt_inv | term | int_rate | installment | grade | sub_grade | emp_title | emp_length | home_ownership | annual_inc | verification_status | issue_d | loan_status | purpose | title | addr_state | dti | delinq_2yrs | earliest_cr_line | inq_last_6mths | mths_since_last_delinq | open_acc | pub_rec | revol_bal | revol_util | total_acc | out_prncp | out_prncp_inv | total_pymnt | total_pymnt_inv | total_rec_prncp | total_rec_int | total_rec_late_fee | recoveries | collection_recovery_fee | last_pymnt_d | last_pymnt_amnt | last_credit_pull_d | pub_rec_bankruptcies | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 39707 | 5000 | 5000 | 525.00 | 36 months | 9.33% | 159.77 | B | B3 | Stark and Roth Inc | 2 years | MORTGAGE | 180,000.00 | Not Verified | Jul-07 | Fully Paid | home_improvement | home improvment loan | WI | 11.93 | 0 | Feb-95 | 1 | 0.00 | 16 | 0 | 60568 | 39.20% | 38 | 0.00 | 0.00 | 5,751.53 | 603.91 | 5,000.00 | 751.53 | 0.00 | 0.00 | 0.00 | Jul-10 | 161.55 | Jun-07 | NaN |
| 39708 | 5000 | 5000 | 375.00 | 36 months | 9.96% | 161.25 | B | B5 | Millenium Group | 4 years | MORTGAGE | 48,000.00 | Not Verified | Jul-07 | Fully Paid | debt_consolidation | Tito5000 | FL | 8.03 | 0 | Aug-95 | 1 | 0.00 | 6 | 0 | 28329 | 48.60% | 6 | 0.00 | 0.00 | 5,804.73 | 435.36 | 5,000.00 | 804.73 | 0.00 | 0.00 | 0.00 | Jul-10 | 162.07 | Jun-10 | NaN |
| 39709 | 5000 | 5000 | 675.00 | 36 months | 11.22% | 164.23 | C | C4 | Self-Employeed | < 1 year | OWN | 80,000.00 | Not Verified | Jul-07 | Fully Paid | credit_card | P's Family Credit Loan | WI | 1.21 | 0 | Jul-96 | 3 | 0.00 | 15 | 1 | 27185 | 16.10% | 29 | 0.00 | 0.00 | 5,912.05 | 798.13 | 5,000.00 | 912.05 | 0.00 | 0.00 | 0.00 | Jul-10 | 165.17 | Jun-07 | NaN |
| 39710 | 5000 | 5000 | 250.00 | 36 months | 7.43% | 155.38 | A | A2 | Rush Univ Med Grp | 1 year | OWN | 85,000.00 | Not Verified | Jul-07 | Fully Paid | credit_card | My Credit Card Loan | WI | 0.31 | 0 | Oct-97 | 0 | 0.00 | 7 | 0 | 216 | 0.60% | 19 | 0.00 | 0.00 | 5,593.63 | 279.68 | 5,000.00 | 593.63 | 0.00 | 0.00 | 0.00 | Jul-10 | 156.29 | Jun-07 | NaN |
| 39711 | 5000 | 5000 | 700.00 | 36 months | 8.70% | 158.30 | B | B1 | A. F. Wolfers, Inc. | 5 years | MORTGAGE | 75,000.00 | Not Verified | Jul-07 | Fully Paid | credit_card | Reduce Credit Card Debt | CO | 15.55 | 0 | May-94 | 0 | 0.00 | 10 | 0 | 66033 | 23% | 29 | 0.00 | 0.00 | 5,698.60 | 797.80 | 5,000.00 | 698.60 | 0.00 | 0.00 | 0.00 | Jul-10 | 159.83 | Nov-14 | NaN |
| 39712 | 2500 | 2500 | 1,075.00 | 36 months | 8.07% | 78.42 | A | A4 | FiSite Research | 4 years | MORTGAGE | 110,000.00 | Not Verified | Jul-07 | Fully Paid | home_improvement | Home Improvement | CO | 11.33 | 0 | Nov-90 | 0 | 0.00 | 13 | 0 | 7274 | 13.10% | 40 | 0.00 | 0.00 | 2,822.97 | 1,213.88 | 2,500.00 | 322.97 | 0.00 | 0.00 | 0.00 | Jul-10 | 80.90 | Jun-10 | NaN |
| 39713 | 8500 | 8500 | 875.00 | 36 months | 10.28% | 275.38 | C | C1 | Squarewave Solutions, Ltd. | 3 years | RENT | 18,000.00 | Not Verified | Jul-07 | Fully Paid | credit_card | Retiring credit card debt | NC | 6.40 | 1 | Dec-86 | 1 | 5.00 | 6 | 0 | 8847 | 26.90% | 9 | 0.00 | 0.00 | 9,913.49 | 1,020.51 | 8,500.00 | 1,413.49 | 0.00 | 0.00 | 0.00 | Jul-10 | 281.94 | Jul-10 | NaN |
| 39714 | 5000 | 5000 | 1,325.00 | 36 months | 8.07% | 156.84 | A | A4 | NaN | < 1 year | MORTGAGE | 100,000.00 | Not Verified | Jul-07 | Fully Paid | debt_consolidation | MBA Loan Consolidation | MA | 2.30 | 0 | Oct-98 | 0 | 0.00 | 11 | 0 | 9698 | 19.40% | 20 | 0.00 | 0.00 | 5,272.16 | 1,397.12 | 5,000.00 | 272.16 | 0.00 | 0.00 | 0.00 | Apr-08 | 0.00 | Jun-07 | NaN |
| 39715 | 5000 | 5000 | 650.00 | 36 months | 7.43% | 155.38 | A | A2 | NaN | < 1 year | MORTGAGE | 200,000.00 | Not Verified | Jul-07 | Fully Paid | other | JAL Loan | MD | 3.72 | 0 | Nov-88 | 0 | 0.00 | 17 | 0 | 85607 | 0.70% | 26 | 0.00 | 0.00 | 5,174.20 | 672.66 | 5,000.00 | 174.20 | 0.00 | 0.00 | 0.00 | Jan-08 | 0.00 | Jun-07 | NaN |
| 39716 | 7500 | 7500 | 800.00 | 36 months | 13.75% | 255.43 | E | E2 | Evergreen Center | < 1 year | OWN | 22,000.00 | Not Verified | Jun-07 | Fully Paid | debt_consolidation | Consolidation Loan | MA | 14.29 | 1 | Oct-03 | 0 | 11.00 | 7 | 0 | 4175 | 51.50% | 8 | 0.00 | 0.00 | 9,195.26 | 980.83 | 7,500.00 | 1,695.26 | 0.00 | 0.00 | 0.00 | Jun-10 | 256.59 | Jun-10 | NaN |